Home > Computers & Technology > Networking & Cloud Computing

Handbook of Research on Big Data and the IoT by Kaur Gurjit

Author:Kaur Gurjit , Date: August 15, 2020 ,Views: 100

Handbook of Research on Big Data and the IoT by Kaur Gurjit

Author:Kaur Gurjit
Language: eng
Format: epub
Publisher: Engineering Science Reference

4.1.3 Data Storage

HDFS (Hadoop distributed file system), S3 (Simple storage services)

Servers: EC2, Google App Engine, Elastic, Beanstalk, Heroku

4.1.4 Data Processing

R, Yahoo! Pipes, Mechanical Turk, Solr/Lucene, ElasticSearch, Datameer, BigSheets, Tinkerpop

We now examine two of the most popular Big Data processing frameworks, MapReduce and Hadoop, in detail.

4.2. MapReduce

It is a data processing computational framework applied to large datasets by employing distributed algorithms on clusters. This framework comprises user-defined Map and Reduce functions as well as a MapReduce library. Data is processed in parallel using map functions, whose output is sorted and processed by reducing functions. The MapReduce library parallelizes the data processing by breaking it down into smaller chunks that are processed using a master/slave implementation. Typically, the MapReduce framework is implemented in six steps as follows.

Step 1: Read data value from the Hadoop Distributed File Systems (HDFS).

Step 2: Split the task into small tasks.

Step 3: Input key/value pairs to Map function to generate intermediate key/value pairs.

Step 4: From the output of the Map function, identify and send all pairs with the same key to the Reduce function.

Step 5: Sort the input to the reduce function by key.

Step 6: Write the reduced output into the HDFS.

Download

Handbook of Research on Big Data and the IoT by Kaur Gurjit.epub

Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.

Categories

Linux & Unix	iPhone & iOS
Macintosh	Android
Business Technology	Certification
Computer Science	Databases & Big Data
Digital Audio, Video & Photography	Games & Strategy Guides
Graphics & Design	Hardware & DIY
History & Culture	Internet & Social Media
Mobile Phones, Tablets & E-Readers	Networking & Cloud Computing
Operating Systems	Programming
Programming Languages	Security & Encryption
Software	Web Development & Design

Popular ebooks

The Mikado Method by Ola Ellnestam Daniel Brolund(22542)
Kotlin in Action by Dmitry Jemerov(19348)
Grails in Action by Glen Smith Peter Ledbrook(16801)
Sass and Compass in Action by Wynn Netherland Nathan Weizenbaum Chris Eppstein Brandon Mathis(14285)
Configuring Windows Server Hybrid Advanced Services Exam Ref AZ-801 by Chris Gill(7521)
Azure Containers Explained by Wesley Haakman & Richard Hooper(7515)
Running Windows Containers on AWS by Marcio Morales(7067)
Microsoft 365 Identity and Services Exam Guide MS-100 by Aaron Guilmette(5451)
Microsoft Cybersecurity Architect Exam Ref SC-100 by Dwayne Natwick(5291)
Combating Crime on the Dark Web by Nearchos Nearchou(5044)
The Ruby Workshop by Akshat Paul  Peter Philips  Dániel Szabó  and Cheyne Wallace(4720)
Management Strategies for the Cloud Revolution: How Cloud Computing Is Transforming Business and Why You Can't Afford to Be Left Behind by Charles Babcock(4563)
Python for Security and Networking - Third Edition by José Manuel Ortega(4296)
The Age of Surveillance Capitalism by Shoshana Zuboff(4275)
Learn Windows PowerShell in a Month of Lunches by Don Jones(4192)
Learn Wireshark by Lisa Bock(4192)
Ember.js in Action by Joachim Haagen Skeie(4074)
The Ultimate Docker Container Book by Schenker Gabriel N.;(3938)
DevSecOps in Practice with VMware Tanzu by Parth Pandit & Robert Hardt(3628)
Windows Ransomware Detection and Protection by Marius Sandbu(3599)